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ABSTRACT 

The implications of the massive, X-ray selected cluster of galaxies MS1054-03 at z = 0.83 are discussed 
in light of the hypothesis that the primordial density fluctuations may be nongaussian. We generalize 
the Press-Schechter (PS) formalism to the nongaussian case, and calculate the likelihood that a cluster 
as massive as MS1054 would have been found in the Einstein Medium Sensitvity Survey (EMSS). A 
fiat universe (£Im + = 1) is assumed and the mass fluctuation amplitude is normalized to the the 
present-day cluster abundance. The probability of finding an MS1054-like cluster then depends only on 
fiivi and the extent of primordial nongaussianity. We quantify the latter by adopting a specific functional 
form for the PDF, denoted which tends to Gaussianity for A ^ 1 but is significantly nongaussian for 
A,$ 10, and show how A is related to the more familiar statistic T, the probability of > 3er fluctuations 
for a given PDF relative to a Gaussian. Special attention is given to a careful calculation of the virial 
mass of MS1054 from the available X-ray temperature, galaxy velocity, and weak lensing data. 

We find that Gaussian initial density fluctuations are consistent with the data on MS1054 only if 
~ 0.2. For S7m > 0.25 a significant degree of nongaussianity is required, unless the mass of MS1054 has 
been substantially overestimated by X-ray and weak lensing data. The required amount of nongaussianity 
is a rapidly increasing function of f?M for 0.25 < < 0.45, with A < 1 (T£7) at the upper end of 
this range. For a fiducial dyi — 0.3, ft\ — 0.7 universe, favored by several lines of evidence (Wang et 
al. 1999), we obtain an upper limit A < 10, corresponding to a T > 3. This finding is consistent with 
the conclusions of Koyama, Soda, & Taruya (1999), who applied the generalized PS formalism to low 
(z^O.l) and intermediate (z^0.6) redshift cluster data sets. 



1. INTRODUCTION 

A working hypothesis in most approaches to cosmologi- 
cal structure formation is that the density fluctuation field 
<5(x) is Gaussian at early times. This assumption follows 
naturally from the idea that the primordial fluctuations 
may be described as superpositions of statistically inde- 
pendent Fourier modes, 



<5(x) 



5> k e ik ' x . 



(1) 



Such modes are generic predictions of inflation, in which 
quantum fluctuations are exponentially enlarged to be- 
come the classical density fluctuations. Whatever the form 
of the power spectrum P(k) — |<5k| 2 j the independence of 
the Fourier modes guarantees, by the central limit theo- 
rem, that the real-space density fluctuations are Gaussian. 

Although the Gaussian hypothesis is well-motivated, 
several lines of argument suggest that it merits further 
scrutiny. On the theoretical side, Peebles (1999a, b) has 
presented a model in which nongaussianity emerges natu- 
rally in a modified inflationary scenario. The key ingredi- 
ent is that the scalar field which drives inflation, ip, couples 
quadratically to a massive scalar field <p which, at the end 
of inflation, produces the cold dark matter (CDM). The 
CDM density field is then given by pcdm(x) = M 2( / ,2 ( x )/2, 
where 0(x) is a Gaussian random field. The fluctuations 
in the CDM component, <5(x) oc pcdm(x) — (pcdm), are 
thus distributed like a % 2 function shifted to have van- 
ishing mean. Such a distribution is strongly nongaussian, 
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with significant skew and excess kurtosis. (Other inflation- 
ary models which lead to nongaussian density fluctuations 
have recently been described by Salopek [1999] and by 
Martin, Riazuelo, & Sakellariadou [1999].) 

Peebles' model has been extended by White (1998) and 
by Koyama, Soda, & Taruya (1999), who have considered 
models in which the CDM is produced by a finite num- 
ber, m, of quadratically coupled fields. The resulting den- 
sity fluctuations would then have a \m distribution (again, 
shifted to have mean zero), which approaches Gaussianity 
in the limit of large m. In this way one can generate pri- 
mordial fluctuations with any desired amount of nongaus- 
sianity. However, the required number of quadratically 
coupled fields must be quite large ( £ 100) if the wanted 
amount of nongaussianity is small. Such a large number 
of inflationary fields arguably makes this model contrived. 

On the observational side, hints of nongaussianity have 
emerged, first, from Cosmic Background Radiation (CBR) 
anisotropy maps. Ferreira et al (1998, 1999; see also 
Magueijo 1999) analyzed the full-sky COBE-DMR maps 
(Bennett et al. 1996), finding evidence of nongaussianity 
on large ( <L 10°) angular scales using a statistic known as 
the normalized bispectrum estimator. This finding was 
confirmed by two independent analyses using alternative 
statistical techniques (Novikov, Feldman, & Shandarin, 
1998; Pando, Valls-Gabaud, & Fang, 1998). These studies 
appear to rule out Gaussianity of the CBR anisotropies 
at about the 95% confidence level. A curious feature of 
these COBE-DMR results is that the nongaussian signal 
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goes away if the north galactic cap is excluded from the 
analysis; this is a surprising property one that argues for 
caution in ascribing reality to these findings (see Bromley 
& Tegmark 1999 for a detailed treatment of this issue). In 
an unrelated study of CBR anisotropy detections on de- 
gree scales, Gaztahaga, Fosalba, & Elizalde (1998) found 
evidence for nongaussianity based on the large variance 
among the reported anisotropy amplitudes. This finding, 
like that of Ferriera et al, should be considered prelimi- 
nary, as it is based on an intercomparison of very different 
data sets and is strongly dependent on the accuracy of 
the reported errors. Nonetheless, these results from CBR 
anisotropy data call into question the hypothesis of pri- 
mordial gaussianity. 

The second line of observational evidence comes from 
the abundance and clustering of rich clusters of galaxies. 
As the most massive virialized systems in the universe, 
rich clusters are diagnostic of the background cosmologi- 
cal parameters in a number of ways (cf. Bahcall 1999 for 
a recent review). In particular, their number density and 
correlation length, as a function of mass and redshift, can 
be predicted from the Press-Schechter (1974, hereafter PS) 
formalism, summarized in § || below. The original PS 
formalism assumes primordial Gaussianity, but this as- 
sumption is readily relaxed, leading to a generalized PS 
approach (§ 2.3). Robinson, Gawiser, & Silk (1998) ap- 
plied the generalized PS formalism to low- redshift [z 0.1) 
cluster abundance and correlation data. They found the 
data to be consistent with the Gaussian hypothesis in an 
flyi — 1 universe. 2 For lower values of the density param- 
eter, however, Robinson et al. found that a substantial de- 
gree of primordial nongaussianity is required to reconcile 
the number density of massive clusters with their observed 
correlation length. Koyama, Soda, & Taruya (1999) fur- 
ther narrowed these constraints by applying the general- 
ized PS formalism to intermediate-redshift (z ~ 0.5-0.6) 
cluster abundance data, as well as to the low redshift data 
sets studied by Robinson et al. They found that S^m > 0.5 
was ruled out by the combined data sets regardless of the 
nature of the initial density fluctuations, and that Gaus- 
sian fluctuations were ruled out for J7m < 0.5. We defer a 
fuller discussion of their results to the final section of this 
paper, after introducing the necessary terminology in § |[ 

The studies cited above provide evidence that the hy- 
pothesis of Gaussian density fluctuations is violated at 
some level. The purpose of this paper is to extend the anal- 
yses of Robinson et al. (1998) and Koyama et al. (1999) 
by applying the generalized PS approach to a single high- 
redshift cluster, MS1054-03, which at z — 0.83 is one of 
the highest redshift clusters known. It is, moreover, the 
most extensively studied of the known high-redshift clus- 
ters; its mass has been accurately estimated by the tech- 
niques of weak gravitational lensing (Luppino & Kaiser 
1997), X-ray temperature analysis (Donahue et al. 1998), 
and galaxy velocity dispersion (Tran et al. 1999). These 
studies all found that MS 1054 is an unusually massive 
cluster, with a virial mass > 1O 15 /i _1 M . Its high red- 
shift, large and accurately determined mass, and its having 
been selected from the Einstein Medium Sensitivity Sur- 
vey (Henry et al. 1992; EMSS), whose selection criteria 



are very well understood, combine to make MS1054 espe- 
cially well suited for the present study. A previous analy- 
sis of MS1054 (Bahcall & Fan 1998) noted these features, 
and derived important cosmological constraints, but did 
so under the assumption of primordial Gaussianity. We 
will extend their results by including nongaussianity as an 
additional degree of freedom. 

This paper is also meant to serve two additional pur- 
poses. First, we will introduce (§ ||) a new mathemati- 
cal description of nongaussian density fluctuations, in the 
form of a probability distribution function (PDF) we refer 
to as the ^^-distribution. We will discuss the relation- 
ship between the ipx and Xm distributions, and argue that 
the former is perferable to the extent that a fully generic, 
model-independent parameterization of nongaussianity is 
desired. Second, we will present (§ ||) a detailed discussion 
of the proper determination of cluster virial masses from 
X-ray temperature, galaxy velocity dispersion, and weak 
lensing data sets, and apply the results to MS1054. This 
discussion will, it is hoped, be useful in clarifying the ways 
in which virial masses are dependent on cosmological pa- 
rameters and, more subtly, on cluster mass models, and 
thus inform future studies based on larger data sets. 

The outline of this paper is as follows. In § ^[ we de- 
scribe the Gaussian and generalized PS formalisms. In 
§ ||, we introduce the ip\ distribution. In § ^ we discuss 
the determination of cluster virial masses. In § |^ we use 
published data to estimate the virial mass of MS1054. In 
§ H we apply the generalized PS formalism to MS1054 and 
derive constraints on nongaussianity as a function of ^m- 
Finally, in § |t], we further discuss and summarize the main 
results of the paper. 

2. THE PRESS-SCHECHTER FORMALISM AND ITS 
EXTENSION TO THE NONGAUSSIAN CASE 

Under the assumption of primordial Gaussianity, the PS 
formula for the comoving number density of virialized ob- 
jects of mass M is 



/ r 2 p SJz) 

n(M,z ) = J-J?-J*± 

V 7T M Z OM 



din ctm 



dhxM 



-6 c (zf/2<r 2 M 



(2) 



where <7m is the rms mass fluctuation on a mass scale M, 
~p = flMPcrit is the comoving mean mass density, and S c (z) 
is the critical density for collapse at redshift z. Because 
<7m is by definition the rms density fluctuation linearly 
extrapolated to the present, 5 c (z) is similarly normalized 
to the present, 



S c (z; Qm, Oa) = 5q(z) 



d(z = o ; n M ,n A ) 



D(z;UmMa) 
The linear fluctuation growth factor is given by 

1 + z' 



E{zJ 



dz' . 



(3) 



(4) 



where E(z) = ^JVl M (l + z) 3 + n R (l + z) 2 + fi A , + 
+ ^A = 1) and we are following Peebles' (1993) no- 
tation. Throughout this paper we assume a flat universe, 
n R = 0. 

2 Here and throughout this paper, the present value of the mass density parameter is denoted S!m, while the density parameter associated with 
a cosmological constant A (or, equivalently, vacuum energy density) is denoted Q A . The Hubble constant is written as Hq = 100 h km s — 1 Mpc — 1 . 
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The quantity Sq(z) in equation ^ is the famous factor 
3(12tt) 2 / 3 /20 = 1.686 for an Einstein de-Sitter universe. 
It has a weak dependence on cosmological parameters and 
redshift. In this paper, we use the forms of 5q{z) derived 
by Kitayama & Suto (1996). 

2.1. Calculation of om 

The rms mass fluctuation om in equation [2] is computed 
from the linear power spectrum P(k) as follows: 



' M 



£°^A 2 (fc)^ 2 (fci?), 



(5) 



where A 2 (fc) = k 3 P(k)/2ir 2 is the mass variance per loga- 
rithmic wavenumber interval, W{kR) is the Fourier trans- 
form of the window function defining the mass scale M, 
which we take to be the usual top-hat, and R is the ra- 
dius of a sphere which contains mass M in an unperturbed 
universe. In this paper we assume a CDM-dominated uni- 
verse, in which the power spectrum is well-approximated 

by 

3+n 



A 2 (k) = s 2 H (n 



M 



T 2 o/r) 



(6) 



(Bunn & White 1997), where Sh is the rms overdensity at 
horizon-crossing, n is the primordial spectral index, and 
T(q) is the CDM transfer function, for which we adopt the 
analytic approximation of Bardeen et al. (1986). The pa- 
rameter r determines the position of the power-spectrum 
"turnover" in fc-space. For CDM it is given by T w ttyih. 
For the calculations of this paper, the precise value of T is 
relatively unimportant, and we adopt the value T — 0.20, 
consistent with observations of large-scale structure data 
(e.g., Liddle et al. 1996) and with the CDM expectation for 
reasonable values of the cosmological parameters. We also 
assume n = 1, consistent with estimates derived from the 
large-scale CBR anisotropies observed by COBE (Gorski 
et al. 1996). Changing our adopted values of T and n 
by ,$ 20%, roughly their allowed ranges, would have little 
effect on the main conclusions of this paper. 

2.2. Conversion to useful units 

To make practical calculations, we reexpress the PS for- 
mula in terms of suitably scaled variables. We first define 
a dimensionless mass m by 



47T o 

= ~3 r 8 Pcrit m 



5.95 x lO 14 ™^ 1 M Q , (7) 



where r§ = 8/z." 1 Mpc and p cr it — 3Hq/8itG is the crit- 
ical density. For rich clusters, m is of order unity. Let 
n(m)dm be the comoving number density of clusters with 
dimensionless masses in the range (m, m + dm). Then 



n(m, z) — n(M, z) 



dM 
dm 



2 3f^M S c {z) 
tt Air r| m 2 om 



din om 



dhxM 



(8) 



where we have substituted the definition of m into equa- 
tion 1. Furthermore, as noted above, we compute om not 



as a function of mass but of a length scale R. n 
M = ^irR? m n M p crU , or equivalently, 



Pm 

Thus, d In uuld In M - 
ing this into equation | 
yields 



'8 



m 



M 



1/3 



defined by 



(9) 



1/3 x d In CTM/d In R. Substitut- 
and evaluating numerical factors 



n(m, z) = n 



n 



M 



d In om 



d\nR 



v m <t>{v m ), (10) 



where n = 3.11 x 10 _4 (/i _1 Mpc) -3 , <j>{x) is a Gaus- 
sian of zero mean and unit variance, and v m (z) = 
S c (z; £lM)/<7M{Rm)- Equation ^0| gives the comoving num- 
ber density per unit dimensionless mass; note that most 
of the f^M-dependence of this expression resides in the fac- 
tor u m , not in the out in front. Finally, the directly 
observed quantity is N{> m, z), the comoving number den- 
sity of all clusters of mass > m at redshift z, given by 



N(> m, z) = uq O 



M 



00 (W 
, m' 2 



din om 



d\nR 



v m i(t>{v m <) . 

(11) 



2.3. Extension to Nongaussian Fluctuations 

The above formulae assumed a Gaussian PDF. We now 
relax this assumption, and assume only that the PDF, 
P(S\M), is such that (6) = and that (S 2 ) = o 2 M . It 
is useful to express the PDF in terms of a dimensionless 
function ip(x) as follows: 



P(5\M) = o M x ^ 



_S_ 

CTM 



(12) 



where ip(x) satisfies the conditions J ij)(x)dx = 1, 
/ x ip{x) dx — 0, and J x 2 ip(x) dx = 1 . 

If one now retraces the usual steps leading to the PS 
abundance formula, equation ^, but using the PDF given 
by equation [l2] in place of the usual Gaussian, one arrives 
at the expression 



n(M, z) 



M 2 



din o M 



d\nM 



TP 



S c (z) 
om 



(13) 



The quantity is given by (2 ip(x) dx) ; it corrects 
for underdense regions that are incorporated into virializcd 
structures according to the standard PS ansatz. Equa- 
tion [l3| is our generalization of the PS formalism for non- 
Gaussian density perturbations. Setting ip(x) to a Gaus- 
sian yields equation 1, as may readily be verified. Adopt- 
ing dimensionless mass units and repeating the steps of 
§ 2.2, we obtain for N(>m, z) the result 



N(>m,z) = f^no f2 M 



r°° dm' 


dlnoM 


Jm m' 2 


dlnR 



(14) 

where the meaning of no and v m are the same as above. 
The difference between the standard and generalized PS 
abundance predictions thus consists simply in replacing 
the Gaussian PDF <\> by the function ip, apart for the fac- 
tor f^,, which as shown below is ~ 1 for cases of interest. 
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3. DESCRIPTION OF NONGAUSSIANITY 

We wish to consider non-Gaussian fluctuations generi- 
cally, i.e., to construct a PDF meeting the above conditions 
but which is not tied to a particular model. Moreover, we 
want our distribution to contain a parameter which quan- 
tifies the degree of nongaussianity. As noted above, a pos- 
sible choice is the Xm distribution, a generalization of the 
Peebles (1999a, b) model proposed by White (1998) and 
by Koyama et al. (1999). Here we suggest an alternative 
parameterization. Our reasons for doing so are twofold: 
first, the \m distribution is associated with a particular 
physical model, whereas our preference is to avoid such 
association at this preliminary stage in our understanding 
of the primordial fluctuations; and second, the Xm model 
approaches gaussianity only in the limit of very large to, 
a fact which makes the physical interpretation difficult to 
sustain if the observationally required degree of nongaus- 
sianity is slight . 

Our model is based on a modified form of the Poisson 
distribution. Consider, first, an integer random variable n 
that is Poisson-distributed with expectation value A. The 
probability that n takes on a particular integer value m is 




-4 -2 2 4 -4 -2 2 

X = (5/ O X = (5/ (7 



P(n = to) 



(15) 



The rms deviation of n is y/X. We now imagine that n 
is continuous rather than discrete, and define a related 
random variable x = (n — X)/y/X. The probability dis- 
tribution of x, which we refer to as ip\(x), is a modified 
form of the Poisson distribution, but shifted and scaled so 
that it has mean zero and unit variance. We obtain an 
explicit representation of ip\(x) by letting to = yfXx + A 
in equation [l5|, and then multiplying by \f\~ to renormal- 
ize the distribution. In addition, the factorial function in 
the denominator, which is defined only for integer values 
of its argument, must be replaced by its appropriate gen- 
eralization for continuous variables, the T-function. This 
yields 



r(VAir + a + i) 



(16) 



Fig. 1. — Comparison of a Gaussian, <j> (thip_red curve) with the 
distribution (heavy blue curve), equation ha, for four values of 
the parameter A. Note the progression of tp\ toward Gaussianity for 
A»l. 

Figure [j] compares the ip\ distribution with a Gaussian 
for four values of the parameter A. For A = 3, the differ- 
ences between the two are readily apparent; in particular, 
one sees the significant skewness of the tpx distribution. 
For A = 5, analogous differences can be seen, but they are 
noticeably smaller, and for A = 10 the differences are very 
small. For A = 100, the ip\ 1S indistinguishable, on this 
plot, from a Gaussian. 

The differences between ip\ and a Gaussian, <f>, become 
more apparent, even for A ^ 10, if we examine their behav- 
ior for large values oiS/a. This is shown in Figure || for the 
same four values of A as in the previous figure. We see that 
for A ^,10, ip\(x) can exceed </>(x) by 2-3 or more orders 
of magnitude for 5 /a 4. As noted in § ||, this same factor 
enters directly into the PS-predicted cluster abundance. 
Thus, we expect that if the PDF is described by the ip\ 
distribution, massive clusters, representing rare peaks in 
the initial density field, will be far more abundant than 
they will in the Gaussian case, provided A is not too large. 



This function is defined for x > — (VA + 1/vA). As we 
now show, ip\(x) is a suitable representation of quantifi- 
able, generic departures from Gaussianity. 3 

3 We note that equation [u^ does not guarantee that t/>x[x) will have the key properties we seek: normalization, vanishing mean, and unit 
variance. For small A, the transition from integer to continuous arguments does not preserve these properties, which were inherent in the 
parent Poisson distribution. By direct integration we have found that departures from these properties are completely negligible for A > 10. 
For A < 10, we correct ^\(x) using formulae derived by fitting the deviations from. normalization, vanishing mean, and unit variance. However, 
these corrections are extremely small for the cases of interest, so that equation hq is fully adequate for our purposes. 
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x = 5/a 



Fig. 2. — The ratio of if>x to a Gaussian, plotted for values 
of the argument x > 1. The plotted ratio shows that "rare events," 
S/a 3, are considerably more likely if the distribution of overdensi- 
ties is described by the ip\ function for A ^ 10, than they are in the 
Gaussian case. 
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Fig. 3. — The T statistics of the (solid line) and \m distribu- 
tions (open circles) plotted versus their respective parameters, A and 
m. The axes are scaled such that m = 9A. The close agreement of the 
T statistics for the two distributions reveals their strong similarity in 
terms of the likelihood of rare ( Sj 3<j) events, if one chooses m = 9A. 



The factor that appears in equation [13] may be calcu- 
lated numerically for the tpx distribution. We have done so 
for 2 < A < 300. Within this range the numerical results 
are very well approximated by the expression 



U(X) = 1 + 0.149A 



-0.528 



(17) 



Note that differs from unity by less than 10% for A ^, 2. 

3.1. Comparison with the \m distribution 

The model introduced by White (1998) and by Koyama, 
Soda, & Taruya (1999) postulates that the CDM density 
field is given by 



Pcdm{*) 



i=l 



0?(x) 



(18) 



where the </>i(x) are statistically independent Gaussian 
fields. The overdensity v — S/a in this model is distributed 
like a \ 2 variable with m degrees of freedom, shifted to 
have vanishing mean and unit variance. Explicitly, 



P{v)dv = 



1 



m/2-l 



2 \(ro-l)/2 



(-) 



r (f) 



exp(- y 



(19) 



a distribution we henceforth label \m- 

Koyama, Soda, & Taruya (1999) (see also Robinson, 
Gawiser, & Silk 1998) have advocated quantifying "rare 
event" nongaussianity in terms of a "T-statistic" defined 

b y' _ 

' P{v)dv 



T = 



Is 



'^/ 2 du 



(20) 



the likelihood relative to Gaussian of 3cr or rarer events. 
In terms of this statistic the relationship between the 
and distributions becomes particularly clear. In Fig- 
ure ||, the T statistics for the two distributions are plotted 
versus their respective parameters, with the axes scaled 
such that to = 9A. With this scaling, the two distributions 
have T statistics which agree to within a few percent for 
all A. This shows that in terms of the likelihood of rare 
events, the ip\ and \m distributions are extremely similar 
for m = 9A. It is also interesting to note that, despite the 
near-indistinguishability of ip\ from a Gaussian for A ^ 100 
(Figure 1), the two distributions approach the Gaussian 
value T — 1 extremely slowly as A — > oo. Indeed, one re- 
quires 777 ~ 200 even to achieve T = 2, twice the Gaussian 
value. It is for this reason that the \m model may not be 
suitable for describing mild Gaussianity — at least if one 
were to take its physical basis seriously — because the re- 
quired number of primordial Gaussian fields is excessively 
large. 

4. ON THE DETERMINATION OF CLUSTER VIRIAL 
MASSES 

It is essential when applying PS abundance esti- 
mates that one use the rigorous definition of virial 
mass. Specifically, the PS formulae apply to the mass 
My interior to a radius ry such that ZMy/Airr v = 
Ay(z, f^M, ^a)p(z), where Ay(z, Dm, ^a) is a cosmology- 
and redshift-dependent overdensity factor. In an Einstein- 
de Sitter universe, Ay ~ 178 at all redshifts; for fi M < 1, 
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Ay is larger than this value, and increases with decreasing 
redshift. Kitayama & Suto (1996) have derived analytic 
approximations for Ay(z) as a function of CIm for flat and 
open cosmologies, and we use their formulae in this paper. 

The definition of virial mass above means that one can- 
not compute My from observational data, such as velocity 
dispersion, X-ray temperature, or weak lensing, without 
specifying both a cosmology (CIm and and a model for 
the radial mass distribution of the cluster. The X-ray or 
velocity data are usually derived from the dense, central 
parts of the cluster (r^S500/i _1 kpc), whereas the virial 
radius is generally in the range 1— 1.5ft, -1 Mpc for massive 
clusters. Consequently, an extrapolation is entailed. Weak 
lensing data do yield mass as a function of aperture that, 
in the case of MS1054, extend nearly to the virial radius, 
but the mass in question is a projected mass density, so 
that, again, a model is required to obtain the virial mass. 
These issues are not always well appreciated; in this sec- 
tion we derive My in the context of a specific mass model 
in order to make clear the assumptions involved. 

We adopt the profile of Navarro, Frenk, & White (1997; 
NFW), which has been shown to be a good fit to cluster- 
scale dark matter halos in N-body distributions. The 
NFW profile has a density distribution given by 



p(r) 



Pcrit $c 

x(l + x) 2 



(21) 



where x = r/r s and r s is a scale radius for the halo. The 
central density is determined by the parameter S c , which 
one expects to be of order 10 4 for massive clusters. The 
mass interior to radius r is then given by 



M(r) = An p crit 5 C r 3 s 



ln(l 



1 + x 



(22) 



Substituting equation^ into the expression for virial mass 
yields 



ln(l + x v ) 



Xy 



Ay(l 



■n 



M 



3 6 c 



(23) 



where xy = ry jr s 



4.1. Obtaining My from X-ray temperature or velocity 

dispersion 

Now let us suppose that the observational data consist 
either of a measurement of the X-ray temperature Tx of 
the intracluster gas, or of the rms line-of-sight velocity dis- 
persion of cluster galaxies, a v . Applying the equation of 
hydrostatic equilibrium to the former, or the Jeans equa- 
tion to the latter, leads to the equation 



GM(r) 



•Px,g = [PX,gU 



(24) 



where px,g is the density of the X-ray emitting gas or the 
galaxy population at radius r, and 



kT x 
fj, m p 



X - ray gas ; 
galaxy velocities . 



(25) 



We can solve equation 24 if we make the simplifying 



assumption that the tracer(gas or galaxies) distribution 



is roughly isothermal at the radii from which most of the 
X-ray emission (galaxy velocities) are derived, r ~ r s . I.e., 
we assume that din px, ga s / dlnr rs —2, u 2 rs constant. 
This assumption is reasonable if the gas (galaxies) approx- 
imately trace the dark matter potential, because the NFW 
profile itself is nearly isothermal at such radii. With the 
isothermal assumption we obtain from equation ^ the re- 
lation 
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H r s 



(26) 




a 
o 
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Fig. 4. — The virial mass (upper left panel), virial radius (lower 
left panel), NFW concentration parameter (upper right panel), and 
dimensionless central density <5 C (lower right panel), for a cluster 
with an X-ray temperature of 12.3 keV, plotted as a function of the 
adopted value of the NFW scale radius r s . See text for details. 

Equation ^6] shows that S c is determined from observa- 
tions (i.e., Tx or o~ v ) only if the scale radius r s is known. 
In other words, the two NFW parameters cannot be de- 
termined from a single piece of information. Nevertheless, 
one might hope that the virial mass itself is insensitive to 
the particular values of 8 C and r s provided they are related 
by equation]^. This indeed turns out to be the case, as we 
show by adopting the following procedure. First, we pick 
a value of r s in the range 250-1100/i _1 kpc. From this 
we derive 5 C from equation ^6| and insert the result into 
equation ^3| to obtain xy. Substituting into equation |22] 
then yields, after some algebra, the result 



21 « 2 
My = --r s 



\n{l + xy) 



Xy 



1 + Xy 



(27) 



Several aspects of equation ^ merit further comment. 
First, xy is a function of the observable u and the adopted 
value of r s . Thus, My is not quadratic in u and linear in 
r s . Indeed, the virial mass is virtually independent of the 
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adopted value of r s , as was argued above. This is shown in 
the upper left panel of Figure [|, in which the My is com- 
puted from an X-ray temperature of Tx = 12.3 keV, the 
value obtained by Donahue et al. (1998) for MS1054 (cf. 
§ g). (The fi gure assumes a fiat, Qm = 0.3 cosmology at 
z = 0.83, the rcdshift of MS1054.) For 300 < r s < 1000ft- 1 
kpc, My varies by less than ~ 10% from its mean value of 
1.3 x 10 15 h~ l M Q for the adopted T x . The virial radius r v 
changes by an even smaller percentage over this range. In 
contrast, the inferred central density S c is quite sensitive to 
the adopted r s . To constrain r s further, the NFW concen- 
tration index c is plotted in the upper right hand panel. 
(This quantity is not simply equal to ry/r s , as it is in 
Navarro et al, because the virial overdensity is not taken 
to be exactly 200.) For massive clusters, the simulations 
of Navarro et al. suggest that c ~ 3-5. Thus, the plausible 
values of r s are in the range 300;$ r s 700ft. -1 kpc. As 
we have seen, My varies little for r s in this range. Conse- 
quently, we can be assured that our procedure introduces 
less than about 10% error in the virial mass. 

Second, the virial mass as determined from equation 
differs from what would be obtained had we assumed (as 
is usually done) a purely isothermal structure for the clus- 
ter. Specifically, the virial mass obtained from the NFW 
model is 10-20% larger than an isothermal mass estimate, 
with the precise factor depending on the typical concen- 
tration index. In a sense it is reassuring that the difference 
is relatively small; on the other hand, as we shall see be- 
low, even small mass differences can be crucial in terms of 
inferences about nongaussianity. Constraining the mass 
profiles of clusters is thus an important ongoing task for 
cluster physics. 

Last, it is im po rtant to bear in mind that My as derived 
from equation is dependent on the adopted cosmology 
and redshift, via equation |2^, from which xy is derived. 
Thus, the virial mass of a cluster, derived from tempera- 
ture or velocity observations, cannot be specified indepen- 
dently of adopted cosmology. This property of the virial 
mass, as defined by the PS formalism, will be important 
in the analysis presented below. 

4.2. Obtaining My from weak lensing data 

Weak lensing data constitute the third and, in principle, 
most rigorous means of deriving cluster masses, as there is 
no need to assume hydrostatic or dynamical equilibrium. 
Deriving a virial mass given aperture mass measurements 
nonetheless requires care, as we now discuss. 

The quantity measured directly by a weak lensing shear 
analysis (e.g., Luppino & Kaiser 1996) is the mean dimen- 
sionless surface density k within an angular radius 9 of the 
cluster center. The projected mass within 9 is then given 

by 



M(9) 



DsDl -«o> 



(28) 



4irG D LS 

where Dl, Ds, and Dls are the lens, source, and lens- 
source angular diameter distances respectively. These 
angular diameter distances are all cosmology-dependent; 
moreover, the typical redshift of the lensed sources must 
be assumed. These factors all influence the derived virial 



mass. Evaluating numerical factors in equation 28 yields 



M{9) = 2.13x10 



15 



K 

or V4 



VLVS 



(1 + z L )y LS 



-h~ 1 M Q , (29) 



where the y's are the dimensionless comoving angular di- 
ameter distances as defined by Peebles (1993). 

To make the connection with the NFW halo parame- 
ters, we must equate the observationally derived aperture 
mass above with the projected mass calculated from the 
NFW formulae; we show elsewhere (Willick & Padmanab- 
han 1999) that this is given by 



M{9; 5 C , r.) = 4w Pcrit 6 c r 3 s f3(D L 9/r s ) . 



(30) 



where 



X poo 



JO 



0{x)= ydy (y 2 + w 2 )-^ 2 {l + y 2 + w 2 )- 1 dw (31) 



is the projected (dimensionless) mass density for the NFW 
profile. (Explicit expressions for (3 are given by Willick & 
Padmanabhan 1999.) Equating the lensing-inferred aper- 
ture mass (equation |29| ) with the NFW model aperture 
mass (equation |30| ) yields the following expression for S c 
as a function of r=, k, and 9 : 



4886- 



0.1 V4' 



[3- 1 



-3 



500 kp C y (l + z L )y LS 



(32) 



O 1-5 - 





400 600 800 1000 

r_ (h _1 kpc) 



400 600 800 1000 

r g (h _1 kpc) 



a 
ft 



1560 



520 




400 600 800 1000 

r s (h _1 kpc) 



400 600 800 1000 

r s (hr 1 kpc) 



Fig. 5. — Same as the previous figure, except that now the virial 
mass and related quantities are derived from lensing data. A mean 
convergence of k = 0.08 within 8 = 4', the values for MS1054, have 
been used in the calculation. 

Equation |3| is the analogue of equation ^6| for the case 
of a lensing measurement; it gives d c from the data for 
any adopted value of r s . This suggests that we proceed 
as above: pick a value of r s within a sensible range; cal- 
culate the corresponding 6 C from equation ^2[ and thus 
the virial radius in units of r Sl xy, from equation |2^; ob- 
tain the virial mass My — 4nr v Ayp/3. We expect that, 
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as found in the temperature/ velocity dispersion case, the 
virial mass will be insensitive to the choice of r s . Figure |[ 
in which virial mass, radius, NFW concentration index, 
and central density are plotted for the lensing parameters 
of MS1054 (see next section), shows that this is once again 
the case. 

5. THE VIRIAL MASS OF MS1054 

We now apply the methods of the previous section to es- 
timate the virial mass of MS1054 as a function of SIm- We 
assume a flat universe, J7m + = 1, so that the quantity 
Av(z) discussed in the previous section is fully determined 
by fl M - 

MS1054 lies at a redshift of z = 0.833 (Tran et al. 1999, 
T99), making it one of the most distant confirmed rich 
clusters. Its mass can be estimated via each of the three 
principal methods of the previous section. T99 found 
it to have a velocity dispersion of 1170 ± 150 kms -1 
based on 24 galaxies. An earlier determination of the 
velocity dispersion by Donahue et al. (1998; D98) found 
<r v = 1360 kms^ 1 based on 12 galaxies. D98 also ob- 
tained ASCA data for MS1054, from which they deduced 
a temperature of Tx = 12.3^2 2 keV for the intraclus- 
ter gas, equivalent to a velocity dispersion kTx/f^m p = 
(1413 kms^ 1 ) 2 for /i = 0.59, the value adopted by Borgani 
et al. (1999). Luppino & Kaiser (1997, LK97) obtained V 
and / band images of MS1054 at the 2.2 m telescope on 
Mauna Kea in good (~ 1") seeing, and used these data 
to estimate k(9). Their mass estimates (for which they as- 
sumed D,m — 1) indicate a 7t = 0.1 for 9 = 4 arcmin. How- 
ever, a more conservative estimate, obtained from their 
Figure 7, is k = 0.08 for 9 = 4', and we use this more 
conservative estimate here. 

Using these data we can determine the virial mass of 
MS1054 for an adopted cosmology. Because of the ~ 10% 
variations in the deduced mass with the adopted value of 
r s , we must impose an additional constraint. The most 
reasonable one to adopt is to require the NFW concen- 
tration parameter c to take on a particular value. Here 
we choose c = 4. For the lensing mass, an additional as- 
sumption is required, namely, the mean redshift, z$, of the 
lensed galaxy population. Here we choose zs — 1.5, which 
LK97 considered the most suitable value, and which yields 
mass estimates midway between the minimum (zs ~ 1) 
and maximum (25 w 3) allowed values (LK97). 

Figure ^ shows the virial masses obtained via each of 
the three methods as a function of flyi- The cosmologi- 
cal dependence of the virial mass is clear. We have also 
calculated, for use in the next section, a weighted average 
mass and the 1 a errors on this average. Mass errors were 
estimated by considering the principal sources of error for 
each method: velocity dispersion uncertainty for the dy- 
namical mass, temperature uncertainty for the X-ray mass, 
and source redshift uncertainty for the weak lensing mass. 
The corresponding mass errors were estimated to be 42%, 
35%, and 40% respectively. The individual masses were 
weighted inversely with the squares of the fractional mass 
errors to obtain the average mass, which is shown as a 
heavy solid curve in the plot; the shaded region around 
the average shows its ±1 a uncertainty, which is ~ 22%. 

As the figure shows, while the X-ray and lensing masses 
agree to within ~ 10% for Om ~ 0.5, the dynamical mass 
is 40-50% smaller than the other two. This difference is 



within the observational errors, and thus is not indicative 
of systematic differences. Nonetheless, the sensitivity of 
the PS-predicted cluster abundance to mass makes this 
difference potentially significant. Wc discuss this impor- 
tant issue further in § 7.2 

2.5 |— 1 1 I 1 1 1 I 1 1 1 I 1 1 1 I 1 1 1 I 
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Fig. 6. — The virial mass of MS1054 computed from X-ray, galaxy 
velocity, and lensing data, plotted as a function of Qm - The weighted 
average is plotted as a solid black line, with the light shading showing 
the ±1 a errors. 



6. METHOD OF ANALYSIS AND BASIC RESULTS 

In order to apply the PS formalism to high-rcdshift 
clusters, one must first normalize the mass fluctuations 
on cluster scales by requring the PS prediction to match 
observations at low redshift. The result is generally ex- 
pressed as a relationship between erg, the value of <jm 
within a top-hat sphere of radius 8/i _1 Mpc, and Om- 
Recent results obtained from X-ray temperature and lu- 
minosity data include those of Borgani et al. (1999): 

a s = (0.58 ± 0.06) x n^ QA7+0 - 16nM ; (33) 
Wang & Steinhardt (1998): 

cr 8 = (0.50 ± 0.10) x O; M 0-43-0.33fiM ■ (34) 
and Pen (1998): 

cr 8 = (0.53 ± 0.05) x ^M 53 • (35) 

(In each case, results for a flat universe, £Im + = 1, 
are given; in the case of Wang & Steinhardt (1998) nom- 
inal values of other cosmological parameters have been 
adopted. See the original papers for further details.) Sim- 
ilar expressions have been given by Eke, Cole, & Frenk 
(1996), Girardi et al. (1998), and Suto et al. (1999). 
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Fig. 7. — Three recently published o^-Hm relations derived from 
low-redshift cluster X-ray temperature and luminosity data. 

Fi gur e M s hows a « as a function of Qm as given by equa- 
tions B3L p4| , and p5l The agreement is within the reported 
errors, and is especially good for 0.4 ^ 0.7. However, 
because of the exponential sensitivity of the PS abundance 
predictions to o~m , the small differences in figure [?] have a 
nonncgligiblc effect on the inferred degree of nongaussian- 
ity In what follows, we adopt the Borgani et al. (1999) 
calibration, which is intermediate between the other two 
for f^M in the observationally preferred range (~ 0.2-0.5). 
Furthermore, it is based on the most extensive and recent 
compilation of X-ray data, on which the conclusions of the 
present paper heavily depend. 



6.1. Effect of nongaussianity on the crg-fiM relation 

The above low-redshift os-f^M relations were, of course, 
derived under the assumption of Gaussian primordial den- 
sity perturbations. If we relax this assumption — e.g., as- 
sume the fluctuations are described by the if>\ distribu- 
tion for finite A — we expect the og-fiM relation to change 
as well. To obtain the relation for finite A we first note 
that the relations essentially represent the requirement 
that the z = PS abundance prediction, for any Qm, 
match the observed abundances at a characteristic rich 
cluster mass. Specifically, the Borgani et al. (1999) cal- 
ibration yields Mn(M) = 4.2 x 10~ 6 h~ x Mpc -3 for 
M = 5.95 x 10 14 ft, _1 M o (i.e., for m = 1). By requir- 
ing that this condition hold for finite A as well, we obtain 
modified os-f^M relations for any desired degree of non- 
gaussianity. 



Fig. 8. — The ratio of erg for the i/>^ distribution to the Bor- 
gani et al. value of eg, plotted as a function of Si^. The values of 
(78(A) were obtained by requiring the generalized PS abundance of 
M = 6 X 10 14 ft. -1 Mq clusters to match that obtained for the Gaus- 
sian case, at each value of Cm- 

Figure || shows the effect on the a$-Q,M relation of us- 
ing the generalized PS abundance formula, equation [l3[ 
with the ip\ distribution for A = 3, 10, and 100. It can 
be seen that the changes relative to the Gaussian case are 
rather small, especially for S7m < 0.2. This is because low- 
redshift clusters of moderate mass do not represent very 
high peaks in the initial density field, for low fijc For sig- 
nificant nongaussianity A 3 and J7m ^ 0.4, the fluctuation 
normalization is more subtantially modified. 

6.2. Calculating the expected number of MS 10^3 -like 

clusters 

Once we have modified the normalization of the density 
fluctuations ctm for a given value of A as described above, 
we may calculate the predicted comoving number density 
of clusters above a given mass threshold using equation [lj 
with tp = When Gaussian fluctuations are assumed, 
we use equation |ll|. To determine the number of clus- 
ters, of mass and redshift as large or larger than that of 
MS1054, expected in the EMSS sample, we integrate this 
number density over redshift, multiplying by the appriate 
comoving volume element: 

f Xma * dV 
Nexp = W1054 J N(>m W5i ,z)—dz , (36) 

where: 

1. mio54 is the virial mass, in dimensionless units (cf. 
§ 2.2), of MS1054 as determined by the methods of 
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2. ojio54 is the solid angle covered by the Einstein satel- 
lite to a limiting X-ray flux fainter than the observed 
flux of MS1054. From Tables 1 and 2 of Henrey et 
al. (1992) we find uj 1054 = 0.041 sterradian; 

3. zi = 0.833 is the redshift of MS1054 (T99); 

4. z max is the maximum redshift out to which a cluster 
whose X-ray luminosity is equal to that of MS 1054 
could have been detected by the EMSS. In prac- 
tice, N{> to, z) is dropping so rapidly with increas- 
ing redshift that the result is insensitive to z max for 
Zmax ~ 1) and we conservatively set z max = 1.3 in 
the calculations to follow; 

5. dV/dz is the cosmology-dependent comoving volume 
per unit redshift (e.g., equation 13.61 of Peebles 
1993). 
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Fig. 9.— Expected number of MS1054-like clusters in the EMSS 
survey, plotted as a function of mass, under the assumption of Gaus- 
sian initial density fluctuations. The trg-ClM relation is that of Bor- 
gani et al. (1999), and a flat universe is assumed. The different 
curves are for four different values of SIm- The heavy dashed line 
sloping down from right to left indicates the variation of the virial 
mass of MS1054 with Om (see Figure g). The horizontal dotted line 
indicates a one in ten chance that a cluster as massive as MS1054 
would be present in the EMSS survey. Note that only this occurs 
only for S!m = 0.15. 

Figures || and [l(] show the results of calculating N eX p 
for a Gaussian PDF and a A = 1.5 ^-PDF respectively. 
Each figure plots Afexp for four representative values of 
r^M- The heavy dashed line sloping down and to the left 
indicates how the virial mass of MS1054 changes with J7m 
(cf. § |5|). Where that line intersects the curve for a given 
Hm yields the predicted number of MS1054-like clusters in 
the EMSS. As Figure || shows, this number is quite small 
for all values of 12m > 0.25 in the Gaussian case. A clus- 
ter at z = 0.83, and as massive as the lensing, X-ray, and 



velocity data for MS1054 indicate that it is, is unlikely to 
have been found if the fluctuations are Gaussian, unless 

n M <o.2. 

The situation is significantly changed for a ?p\-PDF with 
A = 1.5. Figure shows that an MS1054-like cluster now 
has a better than 10% chance of being found for J7m ~ 0.4. 
Thus, nongaussian fluctuations allow a much larger value 
of flM to be consistent with the MS1054 data. 
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Fig. 10. — Same as the previous figure, except that now the ex- 
pected number of clusters is calculated using a PDF described by 
the ipx distribution, with A = 1.5. 

Because we are dealing with a sample of one, we cannot 
hope to estimate a value of A per se. It is, however, rea- 
sonable to derive an upper limit on A (i.e., a lower limit 
on the required degree of nongaussianity) as a function of 
Qm- To do so, we calculate, for each Om, the value of A 
required to yield M exp = 0.1, i.e., a one in ten chance that 
an MS1054-like cluster would be found in the EMSS for 
that value of J7m- The resultant value of A may be thought 
of as a 90% confidence level upper limit on A. The corre- 
sponding value of the T-statistic (see Figure ||) would then 
be a 90% confidence level lower limit. 

Figure [fl] shows the results of carrying out this calcula- 
tion, and thus summarizes the main results of this paper. 
The upper limit on A decreases from oo (Gaussian fluctu- 
ations) for f^M = 0.17, to ~ 100 for 17m = 0.20, to much 
smaller values for A £0.25. As the upper limit on A de- 
creases with increasing SIm > signifying increasing nongaus- 
sianity, the lower limit on T increases, from 1 (Gaussian 
fluctuations) at J7m = 0.17, to ~ 2 at VLm = 0.25, to £ 6 for 
J7m > 0.4. The required amount of nongaussianity rapidly 
increases with increasing SIm- This is ultimately a reflec- 
tion of of the later "freeze-out" time for fluctuation growth 
in higher density universes. 
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Fig. 11. — The upper panel shows the 90% confidence level up- 
per bound on A as a function of Cm- The lower panel shows the 
corresponding lower bound on T. These 90% bounds correspond to 
the amount of nongaussianity required for the expected number of 
MS1054-like clusters in the EMSS to be > 0.1. See main text for 
further details. 

7. DISCUSSION 

We have shown that the X-ray selected cluster MS 1054- 
03, at z — 0.83, adds to a small but growing body of ev- 
idence that the primordial density fluctuation field may 
be nongaussian. Specifically, the high mass of MS1054, 
£ 10 15 h~ 1 M Q , indicates that rare fluctuations, 6/<Tm > 3, 
are more probable at early times than they would be in 
the Gaussian case, if SIm ~ 0.2. In this concluding section, 
we further discuss several aspects of this result. 

7.1. Comparison with previous work 

7.1.1. Previous analyses based on MS1054 

Two other recent papers have considered the cosmolog- 
ical implications of MS1054: D98 (cf. § |) and Bahcall & 
Fan (1998; BF98). Both papers reached the conclusion 
that, by virtue of its high mass, MS1054 by itself rules out 
an Einstein-de Sitter universe at a high confidence level. 
D98 stated that MS1054 is consistent with a flat £!m = 0.3 
cosmology, while BF98 found that MS1054 is most con- 
sistent with Om = 0.2. Our results are in rough accord 
with theirs, although for Gaussian fluctuations we find 
f^M < 0.17, somewhat smaller than BF98 and markedly 
lower than D98. The latter difference may result from 
D98's conservative estimate of the virial mass of MS1054, 
~ 7x 10 14 /i~ 1 M , which assumed SIm = 1 and an isother- 
mal mass model. In any case, comparison with D98 and 
BF98 is imprecise because they did not consider nongaus- 
sianity as an additional degree of freedom in the abun- 
dance analysis; their constraints on Qm are valid only if 
the initial density field is Gaussian. 



7.1.2. Other cluster analyses allowing nongaussianity 

A more direct comparison of our results may be made 
with the recent papers by Robinson, Gawiser, & Silk (1998; 
RGS98) and Koyama, Soda, and Taruya (1999; KST99). 
The main points of those papers were summarized in § [If 
here we discuss them further in light of our findings. 

RGS98 and KST99 considered not only the cluster abun- 
dance, but also the cluster correlation length, using obser- 
vational constraints on the latter obtained from the APM 
survey (Croft et si. 1997), and compared with the predic- 
tions of the generalized PS formalism. While increasingly 
nongaussian PDFs increase the predicted cluster abun- 
dance (for given Qm and as), they decrease the predicted 
correlation length. This helps break the degeneracy be- 
tween S7m and nongaussianity (e.g., T) present in any anal- 
ysis which, like the present one, considers only abundance 
data. In particular, while our analysis allows Gaussian 
fluctuations for sufficiently low fi M , RGS98 and KST99 
found that large T is in fact required for low (^,0.2) £!m 
in order to accommodate both abundance and correlation 
length data. 

The analyses of RGS98 and KST99 differed in that only 
the latter used intermediate-redshift cluster abundance as 
an observational constraint (each considers z^Q.l abun- 
dances and correlations). This makes KST99 the more 
powerful probe of parameter space. While RGS98 found 
an Einstein-de Sitter universe with a Gaussian PDF to be 
a good fit to the data, KST99 ruled out n M > 0.5. They 
found that the required degree of nongaussianity is a mini- 
mum for n M ^ 0.3, at which T = 3.8±1.0. From Figure |l| 
we see that for — 0.3 our study indicates T > 3, consis- 
tent with the KST99 result. For a flat J7m = 0.3 universe 
with T = 0.20, RGS98 obtained T = 4.0t^, consistent 
with both KST99 and this paper. 

7.2. On the form of nongaussian PDF 

KST99 assumed P(5\M) oc Xm(^/ cr M), whereas we have 
taken P(S\M) oc ip\(5/a M )- The two PDFs differ more in 
spirit than in practice. The \m distribution has its roots 
in a physical model, in which the CDM is produced at the 
end of an inflationary epoch from m scalar fields which 
couple quadratically to a Gaussian inflaton field. The ip\ 
distribution, on the other hand, has no particular physical 
motivation, being simply a mathematical transformation 
of the familiar Poisson distribution (§ |J). As we showed in 
§ ||, the two distributions are remarkably similar in terms 
of their T-statistic, provided one makes the identification 
m = 9A. In fact, Xm=9\( x ) an< ^ V'aOe) are similar (though 
not identical) at all values of x, not only in their T values, 
for A > 3. 

Our reason for introducing the ip\ distribution is pri- 
marily to make the philosophical point that nongaussianity 
can be modeled phcnomcnologically, and that as a result 
we should be cautious in interpreting our findings as ev- 
idence of any particular physical model. Thus, although 
the cluster data point toward a PDF with T w 4, this 
does not imply that the quadratically coupled inflationary 
model with m ~ 40 is correct. Indeed, any theory which 
invokes such a large number of identical but independent 
scalar fields is suspect on Occam's Razor grounds alone. 
More to the point, any PDF with the indicated level of 
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nongaussianity — such as the ip\ distribution with A ~ 3 — 
can can account for the cluster data. 

Of course, even if the tp\ distribution proves able to de- 
scribe the cluster abundance data, this will not necessar- 
ily constitute evidence that it is a good description of the 
PDF. The cluster data test the positive tail of the PDF, 
not its shape near the peak, which can only be probed 
via statistics sensitive to regions of average density. The 
clustering of galaxies in the mildly nonlinear regime, in 
which perturbation theory (PT) is valid, may allow such 
a probe. Frieman & Gaztahaga (1999) have applied PT 
to the angular 3-point correlation function, and have com- 
pared their predictions to the APM data. They are able 
to rule out a strongly nongaussian PDF, the Peebles \ 2 
density field (cf. § [j]), which has T — 16.3, via this ap- 
proach. They did not, however, consider the milder levels 
of nongaussianity (T « 3-4) indicated by the cluster data 
for f^M — 0.3. Inspection of Figure |l| shows that at such 
levels (A ~ 5) the PDF is close to Gaussian near its peak; 
it may be quite difficult for the approach of Frieman & 
Gaztahaga to constrain nongaussianity at this level. An- 
other test of the PDF will be provided by CMB anisotropy 
data. As mentioned at the outset of the paper, analyses 
of the COBE data have unearthed indications of nongaus- 
sianity. However, those results correspond to a comoving 
scale of ~ lOOO/i -1 Mpc, much larger than the ^lO/i -1 
Mpc cluster scales probed by clusters. Future CMB mea- 
surements sensitive to 10' scale anisotropics will probe 
cluster mass scales directly. It is not yet clear, however, 
that the signal-to-noise ratio per resolution element will 
be sufficient to detect mild nongaussianity. It may well 
be that for the forseeable future the cluster data sets will 
provide the most sensitive tests of PDF nongaussianity. 

7.3. What does it mean? 

If future tests confirm that the initial density field is 
moderately nongaussian, what would this tell us about 
the early universe? It was argued above that this would 
not in and of itself lend credence to any particular infla- 
tionary model. Rather, the basic significance of such a 
finding would be that the fundamental mechanism leading 
to real-space Gaussianity of the initial fluctuations — their 
origin as superpositions of numerous, statistically indpen- 
dent individual Fourier modes (equation 1) is not fully 
operative. This could be because either (i) only a small 
number of Fourier modes contribute effectively to <5(x), 
and the mode amplitudes are themselves nongaussian; (ii) 
the various Fourier modes are not statistically indepen- 
dent; or (iii) some combination of (i) and (ii). It is the 
second of these effects which is responsible for nongaus- 
sianity in Peebles' (1999a, b) model, but the first cannot be 
excluded. Only as observational constraints on nongaus- 
sianity improve will be able to construct realistic models 
of how it arises. This may lead us to a deeper under- 
standing of the fluctuations themselves. Indeed, perhaps 
the greatest lesson to be learned from this paper and the 
literature cited herein is that we still know relatively lit- 
tle about the true origin and character of the primordial 
density fluctuations. 

7.4. Reasons for Caution 

As seen in Figures 9 and 10, the PS-predicted comoving 
density N(> M, z) is a rapidly decreasing function of mass 



for M £ 10 15 /i _1 M . Consequently, the predicted number 
of MS1054-like clusters is extraordinarily sensitive to the 
mass we assign to MS1054. We have selected this clus- 
ter because three high-quality data sets, LK97, D98, and 
T99, enable us to estimate its mass by three independent 
methods. However, Figure ^ shows that the velocity dis- 
persion mass estimate for MS1054 is about 50% below the 
X-ray and lensing estimates. This discrepancy is within 
the observational uncertainties and thus does not suggest 
the presence of major systematic errors. On the other 
hand, if the velocity dispersion mass estimate of T99 is 
correct, our conclusions would be markedly changed. We 
would in that case find, using the same criteria as above, 
that Gaussian fluctuations are allowed for Om < 0.33 as 
compared with f^M < 0.17, and that for J7m = 0.45 the 
90% constraint is A < 25 (T > 2) as compared with A < 1 
(T > 7). In short, our conclusion that substantial non- 
gaussianity is indicated by MS1054 is correct only if the 
X-ray and lensing mass estimates are closer to the truth 
than the dynamical one. 

This extreme sensitivity to mass is both the strength 
and weakness of the PS approach. It is a strength because 
it enables even one high-redshift, high-mass cluster such as 
MS1054 to powerfully constrain cosmology. It is a weak- 
ness because it means that even modest random errors in 
mass estimation drastically affect our quantitative conclu- 
sions. To best utilize the PS approach as a cosmologi- 
cal probe, we will need much larger high-redshift (z £ 0.6) 
cluster samples than are presently available. Each sample 
cluster will need to have its mass as accurately and ro- 
bustly measured as MS1054. Only then will the effect of 
mass errors be reduced, by \/~N statistics, to levels at which 
nongaussianity can be constrained (as a function of Qm) 
with high confidence. The EMSS cluster sample has been 
the major source of known intermediate and high redshift 
clusters to date, but its sky coverage is too small at the 
faintest X-ray flux levels to suffice for future work. Future 
distant cluster samples will most likely be derived from 
deep optical surveys to which automated cluster-finding 
algorithms (e.g., Postman et ai. 1996; Kepner et ai. 1998) 
are applied, with cluster candidates followed up with spec- 
troscopy at 8-10 m class telescopes, X-ray satellite obser- 
vations, and ground- or space-based deep imaging, to ob- 
tain dynamical, X-ray temperature, and weak lensing mass 
estimates respectively. 

The second reason for caution is our reliance on the PS 
formalism, whose validity at high masses is not yet con- 
firmed. It has long been known from N-body simulations 
that the PS formula overestimates the number of collapsed 
objects at low masses, M <C M*, where M* is the "non- 
linear mass" at a given epoch defined by o~m{M*) — S c (z). 
At masses M^,M*, however, N-body studies have found 
PS abundance predictions to be remarkably accurate (see, 
e.g., Borgani et ai. 1999 and references therein). The non- 
linear mass at the present time is ~ 5 x 10 13 /i _1 M Q , so 
that rich clusters are safely above M* and thus expected 
to be well-described by the PS formalism. However, the 
PS formula has not been exhaustively tested in the very 
high-mass (M }t 100M*) regime, for the simple reason that 
most N-body simulations contain very few objects in this 
mass range. 

With the recent advent of extremely large simulations, 
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such tests have become possible. In a set of simulations 
of cubic volumes ~500/i _1 Mpc on a side, each contain- 
ing hundreds of rich clusters, Governato et al. (1998) were 
able to test the accuracy of the PS abundance formula 
for virial masses up to ~ 3 x 1O 15 /i _1 M . They found 
excellent agreement between the predicted and observed 
number of clusters in simulations of open, Qm = 0.3-0.4 
universes. However, in their J7m = 1 simulation with a 
low present-day normalization (os ,$ 0.5), they found that 
the PS formula underpredicted the number of clusters by 
a factor of ~ 3-10 for masses greater than ~ 10 15 /i _1 M . 

The case for which Governato et al. found PS to be inac- 
curate, Qm — 1, is not one that is relevant to this paper (we 
considered only SIm < 0.5), and for low-density universes 
(the case of interest here) Governato et al. confirmed the 
PS abundance predictions. Still, these results are cause for 
concern, because if PS underpredicts abundances, we may 
be led to spurious evidence for nongaussianty. On balance, 
then, while the present evidence from N-body simulations 
favors continued use of the PS formalism for cluster anal- 
ysis, the Governato et al. findings suggest that continued 
testing with larger simulations is needed. 

7.5. Conclusion 

We have used the generalized PS formalism to calcu- 
late the likelihood that a cluster as massive as MS 1054- 
03, at a redshift of z = 0.83, would have been found in 
the EMSS sample. The calculations assumed a flat uni- 
verse, Qm + = 1, and mass fluctuations o~m normal- 
ized to the observed abundance of low-redshift (z < 0.1) 
clusters. The expected number of MS1054-like clusters 
then depends on Qm and the deviations from Gaussian- 
ity of the PDF, P(S\M), on cluster mass scales. We 
characterized departures from Gaussianity by assuming 
P(S\M) = cr M V A (J/M), where tp x (x) is defined by equa- 
tion 16. The parameter A quantifies the degree of non- 
gaussianity; ip\ approaches Gaussianity for A 3> 1. 

Special attention has been given to the problem of es- 
timating cluster virial masses from galaxy velocity, X-ray 
temperature, and weak lensing data. Such estimates are 
dependent on the assumed density profile of the cluster, 
for which we have adopted the NFW form, which has been 
shown in N-body simulations to be more realistic than any 
pure power law. We elected to work with MS1054 alone 
because quality X-ray temperature, galaxy velocity, and 
lensing data have been obtained for it (D98, T99, LK97), 
making it a uniquely well-studied high-redshift cluster at 
this time. The X-ray and lensing mass estimates are in 
excellent agreement; the dynamical mass is ~ 50% smaller 
than the other two, but this is within the observational 
errors. 

If the initial density fluctuations are Gaussian, it is im- 
probable that a cluster of the mass, redshift, and X-ray 
flux of MS1054 would be found in the EMSS if fi M > 0.2. 
For example, the chances of finding an MS1054-like clus- 
ter, for a Gaussian PDF, in the EMSS search volume is 
less than about one in fifty if fijyr = 0.25, and less than 
one in three hundred if Qm = 0.35, as shown in Figure 9. 
If the PDF is nongaussian, however, the likelihood can 
be greatly enhanced. To constrain the required amount of 
nongaussianity, we have determined the value of A required 
for an MS1054-like cluster to have a one in ten chance of 
being found in the EMSS. The results are shown in Fig- 



ure |ll| as a function of Qm- The maximum allowed value 
of A estimated using this criterion decreases from ~ 25 for 
Qm = 0.25 to ~ 1 for flu = 0.45. These results may also 
be expressed in terms of the parameter T (equation 20), 
which measures the likelihood of > 3tr peaks in the den- 
sity field relative to the Gaussian case. We find T > 2 for 
f^M = 0.25, increasing to T£6 for J7m~0.4. In short, for 
any value of flyi ~ 0.25 the initial density field must be sig- 
nificantly nongaussian, with the extent of nongaussianity 
increasing rapidly with increasing Qm- 

Because our analysis has considered only the predicted 
cluster abundance, it does not exclude Gaussian fluctua- 
tions if we are willing to accept Qm < 0.2. However, the 
case for a nongaussian PDF irrespective of Qm is strength- 
ened if our results are taken in conjunction with those of 
RGS98 and KST99, who analyzed cluster correlation as 
well as abundance data. In particular, KST99 found that 
nongaussianity was required, at the level T ~ 2-6, for all 
values of Qm less than 0.5, and ruled out larger values of 
Qm- Our estimate T > 3 for Qm > 0.3 is consistent with 
their results. If one takes the findings of RGS98, KST99, 
and this paper at face value, the case for nongaussian ini- 
tial density fluctuations is strong indeed. 

We have, however, identified two potential weaknesses 
in our anlaysis, and by extension with any attempt to 
constrain cosmological parameters from cluster abundance 
data. First, the predicted abundances drop precipitously 
with increasing cluster virial mass at the high masses 
(^lO 15 /i _1 Af ) and redshifts (£0.5) of interest. This 
sensitivity translates into large errors in the derived cos- 
mological parameters for even modest (~ 30%) errors in 
virial mass. This fundamental problem can only be reme- 
died by much larger catalogs of high-redshift massive clus- 
ters; at present, MS1054 is one of but a handful of such 
objects known. These clusters, once identified, will need 
to be followed up with X-ray, galaxy velocity, and weak 
lensing measurements to ensure reliable mass estimates. 
Efforts are presently under way by this author and collab- 
orators, as well as other groups, to obtain such data sets. 
In 5-10 years we will undoubtedly know much more than 
we do now about the evolution of the cluster abundance 
at z *~ 1. 

The second problem concerns the validity of the Press- 
Schechter formalism in the high-mass regime. It has be- 
come conventional wisdom in recent years that the PS 
abundance formula "works much better than it should," 
based as it is on a simple, spherical collapse model, and as 
a result it has been widely and fruitfully used. However, 
if the PS formula underpredicts the actual abundance of 
high-mass objects, as at least one study suggests (Gover- 
nato et al. 1998), one would underestimate Qm (if Gaus- 
sian fluctuations are assumed) or overestimate departures 
from Gaussianity by applying the PS formula to cluster 
abundance data. A challenge for theory and numerical 
simulations in the coming years is to rigorously test the PS 
formula at high masses and, if necessary, replace it with a 
more accurate semianalytical framework. Such theoretical 
groundwork will be crucially important if the high-quality 
cluster data sets of the coming decade are to be fully ex- 
ploited. 
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APPENDIX 
APPROXIMATING a M (R) 

In this appendix we obtain an approximation for um{R) that is valid for CDM models. Using equations 5 and 6 in the 
main text, and assuming for simplicity n = 1, we may write the mass variance as follows: 

o 2 M {R) = i 2 H (^ 5 ) I(TR), (Al) 



H R 



where 



X(rR) = J~ W^T 2 ^) dx. 



(A2) 



Equation Al further simplifies to 



In a M (R) = 16.012 + In S H + 0.5 lnX(TR) - 2 In R , (A3) 

when R is g iven in h^ 1 Mpc. 



Equation A3 shows that ctm(R) is fully determined, apart from a normalizing constant involving Sh, by ]nX(TR). 
Since we fix the normalization by the cluster abundance at low redshift, the 5u term is unimportant here, and a suitable 
expression for X(TR) is all we need to evaluate ctm(-R)- We have obtained such an expression by numerically evaluating 
X(TR) for a range TR and fitting the results to a low-order polynomial of the form 

m 

lnI(TR) = a n [ln(TR)] n . (A4) 

Choosing m = 5 pr oduced a fit with better than 0.1% accuracy for 1 < TR < 8, fully covering the range of expected 
values. In Table 



Al 



we give the coefficients yielding the best fifth-order fit. Substituting this polynomial expression into 
tm up to a normali 
PS abundance formula is then simply 

d In (Tm 1 



equation [A3] yields aM up to a normalizing coefficient. The expression for the logarithmic derivative of ctm needed for the 

; na n HER)]"- 1 - 2 . (A5) 



d\nR 2^ 



Table Al 

Fit Coefficients for Computing lnZ(r\R) 



a 


ai 


ai 


a>3 


0(4 


a 5 


-4.7783 


2.7251 


-0.1811 


-0.0232 


-0.0053 


0.0014 



Notes: Coefficients in the expansion defined by equation 



A4. 



